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Abstract 

Using a geometric argument, we show that under a reasonable continuity condition, the Clarke 
subdifferential of a semi-algebraic (or more generally stratifiable) directionally Lipschitzian function 
admits a simple form: the normal cone to the domain and limits of gradients generate the entire 
Clarke subdifferential. The characterization formula we obtain unifies various apparently disparate 
results that have appeared in the literature. Our techniques also yield a simplified proof that closed 
semialgebraic functions on R™ have a limiting subdifferential graph of uniform local dimension n. 

1 Introduction. 

Variational analysis, a subject that has been vigorously developing for the past 40 years, has proven 
itself to be extremely effective at describing nonsmooth phenomenon. The Clarke subdifferential (or 
generalized gradient) and the limiting subdifferential of a function are the earliest and most widely used 
constructions of the subject. A key distinction between these two notions is that, in contrast to the 
limiting subdifferential, the Clarke subdifferential is always convex. From a computational point of view, 
convexity of the Clarke subdifferential is a great virtue. To illustrate, by the classical Rademacher 
theorem, a locally Lipschitz continuous function / on an open subset U of R" is differentiable almost 
everywhere on U, in the sense of Lebesgue measure. Clarke, in [5], showed that for such functions, the 
Clarke subdifferential admits the simple presentation 

d c f(x) = conv{ lim V/(a; i ) : x t x}, (LipR) 

i— foo 

where x is any point of U and f2 is any full measure subset of U. Such a formula holds great computational 
promise since gradients are often cheap to compute. For example, utilizing ( |LipR[ ), Burke, Lewis, and 
Overton developed an effective computational scheme for approximating the Clarke subdifferential by 
sampling gradients [3], and, motivated by this idea, developed a robust optimization algorithm pQ. 

The authors of [3] further extended Clarke's result to the class of finite-valued, continuous functions 
/: U > R, defined on an open subset U of R ra , that are absolutely continuous on lines, and are direction- 
ally Lipschitzian; the latter means that the Clarke normal cone to the epigraph of / is pointed. Under 
these assumptions on /, the authors derived the representation 



dcf(x) = p| dconv(v/(nnB,(z))Y (ACLR) 
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where Bg(x) is an open ball of radius S around x and f2 is any full measure subset of U, and they 
extended their computational scheme to this more general setting. One can easily see that this formula 
generalizes Clarke's result, since locally Lipschitz functions are absolutely continuous on lines, and for 
such functions (|ACLR[) reduces to ( |LipR[ ) . Pointedness of the Clarke normal cone is a common theoretical 
assumption. For instance, closed convex sets with nonempty interior have this property. Some results 
related to (IACLR[) appear in [T5] . 

In optimization theory, one is often interested in extended real-valued functions (functions that are 
allowed to take on the value +00), so as to model constraints, for instance. Results above are not 
applicable in such instances. An early predecessor of ( |LipR[ ) and (jACLRp does rectify this problem, 
at least when convexity is present. Rockafellar [27] Theorem 25.6] showed that for any closed convex 
function /: R™ — > RU {+00}, whose domain dom/ has a nonempty interior, the convex subdiffcrential 
has the form 

df(x) = conv{ lim V/(xj) : x t ->• x} + N domf (x), (CoR) 

i— >oo 

where x is any point in the domain of / and N dom f(x) is the normal cone to the domain of / at x. 

Our goal is to provide an intuitive and geometric proof of a representation formula unifying ( |LipR[ ), 
(lACLRp . and ([CoRp . To do so, we will impose a certain structural assumption on the functions / that we 
consider. Namely, we will assume that the domain of / can be locally "stratified" into a finite collection 
of smooth manifolds, so that / is smooth on each such manifold. Many functions of practical importance 
in optimization and in nonsmooth analysis possess this property. All semi-algebraic functions (those 
functions whose graphs can be described as a union of finitely many sets, each defined by finitely many 
polynomial inequalities), and more generally, tame functions fall within this class |18j . We will show 
(Theorem I3.10[) that for a directionally Lipschitzian, stratifiable function /: R™ — > R U {+00}, that is 
continuous on its domain (for simplicity), the Clarke subdifferential admits the intuitive form 

d c f{x) = conv{ lim V/(xi) :i, + coco{ lim UVf(xi) : Xi — > x} + N^ om f (x), (1) 

tiXB 

or equivalently, 

d c f(x) = p| clconv(v/(fin B 5 (x))) +7V d c om/ (x), 

<5>0 

where f2 is any dense subset of dom / and coco denotes the convex conical hull. (In contrast to the 
aforementioned results, we do not require f2 to have full-measure). 

This is significant both from theoretical and computational perspectives. Proofs of ( |LipR[ ) and (|ACLR| 
are based largely on Fubini's theorem and analysis of directional derivatives, and though the arguments 
are elegant, they do not shed light on the geometry driving such representations to hold. Similarly, Rock- 
afellar's argument of (jCoRj) relies heavily on the well-oiled machinery of convex analysis. Consequently, a 
simple unified geometric argument is extremely desirable. From a practical point of view, representation 
(JT|) decouples the behavior of the function from the geometry of the domain; consequently, when the 
domain is a simple set (polyhedral perhaps) and the behavior of the function on the interior of the do- 
main is complex, our result provides a convenient method of calculating the Clarke subdiffcrential purely 
in terms of limits of gradients and the normal cone to the domain — information that is often readily 
available. Furthermore, using ((T|), the functions we consider in the current paper become amenable to 
the techniques developed in [3]. 

Whereas ((TJ) deals with pointwise estimation of the Clarke subdifferential, our second result addresses 
the geometry of subdifferential graphs, as a whole. In particular, we consider the size of subdifferential 
graphs, a feature that may have important algorithmic applications. For instance, Robinson |26 ( I25 ) shows 
computational promise for functions defined on R" whose subdifferential graphs are locally homeomor- 
phic to an open subset of R". Due to the results of Minty [20] and Poliquin- Rockafellar [24], Robinson's 
techniques are applicable for convex, and more generally, for "prox-regular" functions. Trying to un- 
derstand the size of subdifferential graphs in the absence of convexity (or monotonicity) , the authors in 
[TU] were led to consider the semi- algebraic setting. The authors proved that the limiting subdifferential 
graph of a closed, proper, semi- algebraic function on R" has uniform local dimension n. Applications 
to sensitivity analysis were also discussed. We show how the techniques developed in the current paper 
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drastically simplify the proof of this striking fact. Remarkably, this dimensional uniformity does not hold 
for the Clarke subdifferential graph. 

The rest of the paper is organized as follows. In Section [3J we establish notation and recall some 
basic facts from variational analysis. In Section [31 we derive a characterization formula for the Clarke 
subdifferential of a directionally Lipschitzian, stratifiable function that possesses a certain continuity 
property on its domain, and in Section 2] we relate our results to the gradient sampling framework. In 
Section [5j we prove the theorem concerning the local dimension of semi-algebraic subdifferential graphs. 
We have designed this last section to be entirely independent from the previous ones (except for Section[2J) , 
since it does require a short foray into semi-algebraic geometry. 

2 Preliminary results. 

In this section, we summarize some of the fundamental tools used in variational analysis and nonsmooth 
optimization. We refer the reader to the monographs Borwein-Zhu [2J, Clarke-Ledyaev-Stern-Wolcnski 
[5J, Mordukhovich [2TH22J, and Rockafellar-Wets for more details. Unless otherwise stated, we follow 
the terminology and notation of |29| . 

The functions we consider will take their values in the extended real line R := RU{— oo,oo}. We say 
that an extended-real- valued function is proper if it is never — oo and is not always +00. For a function 
/ : R n — » R, the domain of / is 

dom/ := {x <E R™ : f(x) < +00}, 

and the epigraph of / is 

epi/ := {(x,r) e R n x R : r > f(x)}. 

Throughout this work, we will only use Euclidean norms. Hence for a point x € R", the symbol \x\ 
will denote the standard Euclidean norm of x. Unless we state otherwise, the topology on R™ that we 

consider is induced by this norm. Given a set Q C R™ , the notation Xi% x will mean that the sequence 
xt converges to x and all the points Xi lie in Q. We let "o(\x — x\) for x £ Q" be shorthand for a function 

that satisfies ^fezffl — > whenever x % x with x 7^ x. 

A function /: R™ — > R is locally Lipschitz continuous at a point x relative to a set Q C R™ containing 
5, if f(x) is finite and there exists a a real number n G [0, +00) with 

\f(x) — f(y)\ < k\x — y\, for all x,y € Q near x. 

If there exists an open neighborhood Q of x so that the above conditions hold, then we simply say that 
/ is locally Lipschitz continuous at x. 

A set-valued mapping F from R™ to R m , denoted by F: R" =4 R"\ is a mapping from R" to the 
power set of R m . Hence for each point x 6 R", F(x) is a subset of R m . For a set-valued mapping 
F: R" R m , the domain of F is 

domF :={xeR n :F(x)^<b}, 

and the graph of F is 

gphF := {(x, y) e R" x R m : y e F(x)}. 

The outer limit of F at x is 

LimsupF(x) := {v € R m : 3xi — > x, 3v.i — !• v with Vi G F(xi)}. 

x^rx 

The mapping F is locally bounded near x if the image F(V) C R m is bounded, for some neighborhood 
V of x. The following definition extends in two ways the classical notion of continuity to set-valued 
mappings. 

Definition 2.1 (Continuity). Consider a set-valued mapping F: R" =4 R m - 
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1. F is outer semicontinuous at a point x £ R™ if for any sequence of points Xi € R™ converging to x 
and any sequence of vectors Vi € F(xi) converging to v, we must have v g -F(x). 

2. F is inner semicontinuous at x if for any sequence of points Xi converging to x and any vector 
v € F(x), there exist vectors g F(xj) converging to u. 

If both properties hold, then we say that _F is continuous at 5. 

We let intQ, clQ, convQ, and cocoQ, denote the interior, closure, convex hull, and convex conical 
hull of a set Q, respectively. A cone Q is said to be pointed if it contains no lines. An open ball of 
radius r around a point x g R™ will be denoted by B r (x). We let B and B be the open and closed unit 
balls, respectively. The following is a standard result on preservation of continuity of set-valued mappings 
under a pointwise convex conical hull operation. We provide a proof for completeness. 

Lemma 2.2 (Preservation of Continuity). Consider a set-valued mapping F: R" =4 R m that is outer- 
semicontinuous at a point x g R™. Suppose F is locally bounded near x and ^ F(x). Then the 
mapping 

x i ^ G{x) := cocoF(x) 
is outer- semicontinuous at x, provided that G(x) is pointed. 

Proof. Consider a sequence (x{,Vi) — > (x,v), with i>, g G{xi) for each index i. By Caratheodory's 
theorem, we deduce 

m 

i=i 

for some multipliers A*- > and vectors j/j g F(xi), where j = 1, . . . , m. 

Restricting to a subsequence, we may assume that there exist nonzero vectors yj g F(x) satisfying 

«j = lim , for each index j. 

We claim that the sequence of multipliers A*- is bounded. Indeed suppose this is not the case and let 
rrii := niax^ A*. Then up to a subsequence, there exist multipliers Xj, not all zero, such that 

A* m 

— — > Xj as i — > oo, and = ^ Xjyj, 

^ 3 = 1 

contradicting the fact that G(x) is pointed. We conclude that the multipliers A*- are bounded. 
Then up to a subsequence, we have 



v = lim V X)y) = V A, % g G(x), 



i=i j=i 

for some real numbers Aj > 0. We conclude that G is outer-semicontinuous at x. □ 
The distance of a point i to a set Q is 

d Q (x) := inf jar — 2/|, 
yeQ 

and the projection of x onto Q is 

P Q (x) := {y g Q : \x - y\ = d Q {x)}. 
We now consider normal cones, which are fundamental objects in variational geometry. 

Definition 2.3 (Proximal normals). Consider a set Q C R™ and a point x g Q. The proximal normal 
cone to Q at x, denoted Nq(x), consists of all vectors v g R™ such that x G Pq(x + -v) for some r > 0. 
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Geometrically, a vector v ^ is a proximal normal to Q at x precisely when there exists a ball 
touching Q at a; such that i; points from S towards the center of the ball. Furthermore, this condition 
amounts to 

(v, x — x) < 0(\x — x\ 2 ) as x — > x in Q. 
Relaxing the inequality above, one obtains the following notion. 

Definition 2.4 (Frechet normals). Consider a set Q C R™ and a point x g Q. The Frechet normal cone 
to Q at x, denoted Nq(x), consists of all vectors v £ R™ such that 

(v, x — x) < o(\x — x\) as x — > x in Q. 

Note that both Nq(x) and Nq(x) are convex cones, while Nq(x) is also closed. Evidently, the set- 
valued mapping x i— > Nq(x) is generally not outer-semicontinuous, and hence is not robust relative to 
perturbations in x. To correct for that, the following definition is introduced. 

Definition 2.5 (Limiting normals). Consider a set Q C R™ and a point The limiting normal 

cone to Q at 5, denoted Nq(x), consists of all vectors v € R™ such that there are sequences Xi % x and 
v i — > v with Vi £ NQ^Xi). 

The limiting normal cone, as defined above, consists of limits of Frechet normals. In fact, the same 
object arises if we only take limits of proximal normals |29l Exercise 6.18]. Convexifying the limiting 
normal cone leads to the following definition. 

Definition 2.6 (Clarke normals). Consider a set Q C R™ and a point x £ Q. The Clarke normal cone 
to Q at x is 

N^(x) := clconv N Q (x). 

Given any set Q C R" and a mapping F: Q —> Q, where Q C R m , we say that F is C p - smooth 
(p > 2) if for each point x £ Q, there is a neighborhood U of x and a C p mapping F : R™ — > R m that 
agrees with F on Q D U. Henceforth, to simplify notation, the word smooth will mean C 2 -smooth. 

Definition 2.7 (Smooth Manifolds). We say that a set M in R" is a C 2 -submanifold of dimension r if 
for each point x £ M, there is an open neighborhood U around x and a mapping F : R™ — > R"~ r that 
is C 2 -smooth with VF(i) of full rank, satisfying M n U — {x £ U : F(x) = 0}. 

A good reference on smooth manifold theory is |19j . 

Theorem 2.8. JM Example 6.8] Consider a C 2 -manifold M C R". Then at every point x £ M , the 
normal cone Nm(x) is equal to the normal space to M at x, in the sense of differential geometry. 

In fact, the following stronger characterization holds. 

Theorem 2.9 (Prox-normal neighborhood). Consider a C 2 -manifold M C R™ and a point x £ M . 
Then there exists an open neighborhood U of x, such that 

1. the projection map Pm is single-valued on U , 

2. for any two points x £ M C\U and v £ U , the equivalence, 

v £ x + N M {x) O x = P M {v), 

holds. 

Following the notation of [T3] , we call the set U that is guaranteed to exist by Theorem 12.91 a 
prox-normal neighborhood of M at x. For more details about Theorem 12.91 see [29 s Exercise 13.38], 6 ( 
Proposition 1.9]. We should note that the theorem above holds for all "prox-regular" sets M [24] . 

Armed with the aforementioned facts from variational geometry, we can study variational properties 
of functions via their subdiffcrcntial mappings. 
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Definition 2.10 (Subdiffcrentials). Consider a function /: R" — > R and a point x € R n , with f(x) 
finite. The limiting subdifferential of / at x is defined by 

df(x) = {v e R" : (v,-l) e iV cpi/ (.T,/(x))}. 

Proximal, Frechet, and Clarke subdifferentials are defined analogously. 

For x such that f(x) is not finite, we follow the convention that dpf(x) — df(x) — df(x) — d c f(x) = 0. 

The subdiffcrentials defined above fail to capture the horizontal normals to the epigraph. Hence to 
obtain a more complete picture, we consider the following. 

Definition 2.11 (Horizon subdifferential). For a function /: R" — > R that is finite at a point x, the 
horizon subdifferential is given by 

d°°f(x) R" : (v, 0) € N cpi f (x, f(x))}. 

For a set Q C R n , we define Sq : R" — > R to be a function that is on Q and +oo elsewhere. We call 
Sq the indicator function of Q. Then for a point 5, we have Nq(x) = dSQ(x), with analogous statements 
holding for the other subdifferentials. 

Often, we will work with discontinuous functions /: R™ — > R. For such functions, it is useful to 
consider / -attentive convergence of a sequence Xi to a point x, denoted Xi x. In this notation we have 

Xi — > x <^=4> Xi x and f(xi) f(x). 

If in addition we have a set Q C R", then X{ x will mean that Xi converges /-attentively to x and 

the points Xi all lie in Q. It is immediate that the mappings df and d°° f are outer-semicontinuous with 
respect to /-attentive convergence x — > x. 

Consider a function /: R" — > R that is locally lower semi-continuous at a point x, with f(x) finite. 
Then / is locally Lipschitz continuous around x if and only if the horizon subdifferential is trivial, that 
is the condition d°°f(x) = {0} holds [29l Theorem 9.13]. Weakening the latter condition to requiring 
d°° f(x) to simply be pointed, we arrive at the following central notion [29, Exercise 9.42]. 

Definition 2.12 (epi-Lipschitzian sets and directionally Lipschitzian functions). 

1. A set Q C R" is epi-Lipschitzian at one of its points x, if Q is locally closed at x and the normal 
cone Nq(x) is pointed. 

2. A function /: R" — ¥ R, that is finite at x, is directionally Lipschitzian at x if / is locally lower- 
semicontinuous at x and the cone d°° f(x) is pointed. 

Rockafellar [28j Section 4] proved that an epi-Lipschitzian set in R™, up to a rotation, locally coincides 
with an epigraph of a Lipschitz continuous function defined on R n_1 . We should further note that the 
Clarke normal cone mapping of an epi-Lipschitzian set is outer-semicontinuous [SJ Proposition 6.8]. 

It is easy to see that a function / : R™ — > R is directionally Lipschitzian at x if and only if the epigraph 
epi/ is epi-Lipschitzian at {x,f{x)). Furthermore, for a set Q that is locally closed at x, the limiting 
normal cone Nq(x) is pointed if and only if the Clarke normal cone Nq(x) is pointed [29l Exercise 9.42]. 

Consider the two functions 



fi(x) = x and f 2 {x) 



x if x < 
x + 1 if x > 



defined on the real line. Clearly both /i and f 2 are directionally Lipschitzian, and have the same deriva- 
tives at each point of differentiability. However <9 c /i(0) =^ <9 c /2(0). Roughly speaking, this situation arises 
because some normal cones to the epigraph of a function /, namely at points (x, r) with r > /(x), may 
not correspond to any subdifferential. Consequently, if we have any hope of deriving a characterization 
of the Clarke subdifferential purely in terms of gradients and the normal cone to the domain, we must 
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eliminate the situation above. Evidently, assumption of continuity of the function on the domain would 
do the trick. However, such an assumption would immediately eliminate many interesting convex func- 
tions from consideration. Rather than doing so, we identify a new condition, which arises naturally as a 
byproduct of our arguments. At the risk of sounding extravagant, we give this property a name. 

Definition 2.13 (Vertical continuity). We say that a function /: R" — > R is vertically continuous at a 
point x € dom / if the equation 

Limsup N cpif (x, r) = N domf (x) x {0}, (2) 

r>f(x) 

holds. 

To put this condition in perspective, we record the following observations. 

Proposition 2.14 (Properties of vertically continuous functions). Consider a proper function f: R" — > 
R that is locally lower- semicontinuous at a point x, with f(x) finite. 

1. Suppose that whenever a pair [x,r) £ epi/, with r > fix), is near (x,f(x)) we have 

N cpi f(x,r) = N domf (x) x {0}. 
Then f is vertically continuous at x. 

2. Suppose that x lies in the interior of dom / and that f is vertically continuous at x. Then f is 
continuous at x, in the usual sense. 

3. Suppose that f is continuous on a neighborhood of x, relative to the domain of f . Then f is 
vertically continuous at all points of dom / near x. 

4- If f is convex, then f is vertically continuous at every point x in dom/. 

5. Suppose that f is "amenable" at x in the sense of \23}j : that is, f is finite at x and there exists a 
neighborhood V of x so that f can be written as a composition f — goF, for a C 1 mapping F : V — > 
R m and a proper, lower- semicontinuous, convex function g : R m — > R, so that the qualification 
condition 

N domg (F(x)) n ker VF(i)* = {0}, (3) 
is satisfied. Then f is vertically continuous at x. 

Proof. Claim Q] is immediate from outer-semicontinuity of the normal cone map N dom t . 

To see [2J suppose that / is vertically continuous at x, £ intdom/. Since the normal cone to the 
domain of / at x consists of the zero vector, we deduce -/V pi f(x, f(x) + -) = {0}, for all sufficiently large 
integers n. By Exercise 6.19], we deduce that each such point (x, f(x) + —) lies in the interior of the 
epigraph epi/. Consequently for any sequence Xi — > x, we have 

limsup f(xt) <f(x) + ~, 

i— yoo ^ 

for all large indices n. Letting n tend to infinity, we deduce that / is upper-semicontinuous at x. The 
result follows. 

To see|3l suppose that / is continuous at x € dom/, relative to the domain of /. Then for any real 
r > f(x) there exists an e > so that the epigraph epi / coincides with the product set, dom / x [r—e, r+e] , 
locally around (x, r). In fact, this follows just from upper-semicontinuity of / at x, relative to dom/. We 
deduce A^ cpi f(x,r) = N dom j(x) x {0}. The result follows by[T] 

To see[H consider a pair {x,r) with r > f(x) and observe 

(v,a) G N cpif (x,r) <^=^ ((«,«), {x,r)) > ({v,a), (x',r')) for all (x',r') € epi/ 
<^=> a = and v € N dom f(x). 
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Appealing to [I] we obtain the result. 

The proof of [5] follows from a standard nonsmooth chain rule. We outline the argument below. 
Without loss of generality, we can assume that the representation / = g o F holds on all of R". Observe 

epi/ = {(x,r) GR" +1 :G(x,r) e epig}, 

for the mapping G : R n+1 — > R m+1 defined by G(x, r) = (F(x),r). We would like to use the chain 
rule appearing in |29[ Theorem 6.14] to compute the normal cone to epi/. To this end, consider a pair 
(x,r) € epi / and a vector (y,a) € N ep i g (G(x, r)). We have 

= VG(a, r)* {y, a) a = and VF(x)*y = 

JVdoma^a:)) and VF(a;)*y = 

where the last equivalence follows from the qualification condition ([3]). Applying the chain rule [291 
Theorem 6.14], we deduce 

A^ cpi f (x, r) = VG(x, r)*N cpig (G(x, r)), 
for all pairs (x,r) € epi/. In particular, if we have r > f(x), or equivalently r > g(F(x)), we deduce 

N epif (x,r) = V F(x)* N domg (F(x)) x {0}. 

The right hand side coincides with AT ( j OI11 /(ar) x {0} by i23J Theorem 3.3]. The result follows by appealing 
toQ] □ 

As can be seen from the proposition above, vertical continuity bridges the gap between continuity of 
the function on the interior of the domain and continuity on the whole domain, and hence the name. 
In summary, all convex and amenable functions have this property, as do functions that are continuous 
on their domains. An illustrative example is provided by the proper, lower semi-continuous, convex 
(directionally Lipschitzian) function / on R 2 , defined by 

y 2 /2x if x > 
f(x,y)={ ifx = 0,y = 

oo otherwise 

This function is discontinuous at the origin, despite being vertically continuous there. 
In the sequel, we will need the following basic result. 

Proposition 2.15. Consider a set M C R™ and a function f : R" —¥ R that is finite-valued and smooth 
on M . Then, at any point x £ M , we have 

df(x)cVg(x) + N M {x), 

where g: R™ — > R is any smooth function agreeing with f on M on a neighborhood of x. 

Proof. Define a function h : R" — > R agreeing with / on M and equalling plus infinity elsewhere. It is 
standard to check that the chain of inclusions, 

df(x) C Bh(x) = d(g(-) + 6 M (-))(x) c Vg(x) + N M (x), 

holds. □ 



3 Characterization of the Clarke Subdifferential. 

Directionally Lipschitzian functions play an important role in optimization and are close relatives of 
locally Lipschitz functions [3J 25]. Indeed, Rockafellar showed that epi-Lipschitzian sets, up to a change 
of coordinates, are epigraphs of Lipschitz functions [55J Section 4]. 
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As was mentioned in the introduction, a key feature of Clarke's construction is that the Clarke subdif- 
ferential of a locally Lipschitz function /, on R", can be described purely in terms of gradient information. 
It is then reasonable to hope that the same property holds for continuous directionally Lipschitzian func- 
tions, but this is too good to be true. Though such functions are differentiable almost everywhere [T], 
their gradients may fail to generate the entire Clarke sub-differential. A simple example is furnished by the 
classical ternary Cantor function — a nondecreasing, continuous, and therefore directionally Lipschitzian 
function, with zero derivative at each point of differentiability. The Clarke subdifferential of this function 
does not identically consist of the zero vector [2J Exercise 3.5.5], and consequently cannot be recovered 
from classical derivatives. This example notwithstanding, one does not expect the Cantor function to 
arise often in practice. 

Nonsmoothness arises naturally in many applications, but not pathologically so. On the contrary, 
nonsmoothness is usually highly structured. Often such structure manifests itself through existence of 
a stratification. In the current work, we consider so-called stratifiable functions. Roughly speaking, 
domains of such function can be decomposed into smooth manifolds (called strata), which fit together in 
a "regular" way, and so that the function is smooth on each such stratum. In particular, this rich class 
of functions includes all semi-algebraic, and more generally, all o-minimally defined functions. See for 
example |30j . We now make this notion precise. 

Definition 3.1 (Locally finite stratifications). Consider a set Q in R™. A locally finite stratification of 
Q is a partition of Q into disjoint manifolds Mj (called strata) satisfying 

• (frontier condition) for each index i, the the closure of Mj in Q is the union of some Mj's, and 

• (local finiteness) each point x £ Q has a neighborhood that intersects only finitely many strata. 

We say that a set Q C R™ is stratifiable if it admits a locally finite stratification. 

Observe that due to the frontier condition, a stratum Mi intersects the closure of another stratum 
Mj if and only if the inclusion M, C cl Mj holds. Consequently, given a locally finite stratification of a 
set Q into manifolds {Mi}, we can impose a natural partial order on the strata, namely 

Mi r< Mj o Mi C cl Mj. 

A good example to keep in mind is the partition of a convex polyhedron into its open faces. 

Definition 3.2 (Stratifiable functions). A function /: R" —> R is stratifiable if there exists a locally 
finite stratification of dom / so that / is smooth on each stratum. 

The following result nicely illustrates the geometric insight one obtains by working with stratifications 
explicitly. 

Proposition 3.3 (Dense differentiability). Consider a proper stratifiable function f : R" — > R that is 
directionally Lipschitzian at all points of dom / near x, and let Q be any dense subset of dom f . Then 
the set Q n domV/ is dense in the domain of f, in the f -attentive sense, locally near x. 

Proof. Consider a locally finite stratification of dom / into manifolds Mi so that / is smooth on each 
stratum. Suppose for the sake of contradiction that there exists a point x £ dom / arbitrarily close to x 
and an /-attentive neighborhood V = {y £ R" : |j/ — rc] < e, \ f(y) — f(%)\ < $} so that V D fi does not 
intersect any strata of dimension n. Shrinking V, we may assume that V intersects only finitely many 
strata, say {Mj} for j € J := {1, . . . , k}, and that the inclusion x £ clMj holds for each index j £ J. 
Notice that since / is continuous on each stratum, the set V is a union of open subsets of the strata Mj 
for j £ J. 

Now among the strata Mj with j £ J, choose a stratum M that is maximal with respect to the partial 
order <. Clearly, we have 

M n cl Mj = 0, for each j £ J with Mj ^ M. 

Now let y be any point of V D M and observe that there exists a neighborhood Y of y so that the 
functions / and / + 5m coincide on Y n M. We deduce that df(y) is a nontrivial affine subspace. Since 
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/ is directionally Lipschitzian at all points in dom/ near x, and in particular at y, we have arrived at 
a contradiction. Thus 51 n dom V/ is dense (in the /-attentive sense) in the domain of /, locally near 
x. □ 

In this section, we will derive a characterization formula for the Clarke subdifferential of a stratifiable, 
vertically continuous, directionally Lipschitzian function / : R™ — > R . This formula will only depend on 
the gradients of / and on the normal cone to the domain. It is important to note that the characterization 
formula, we obtain, is independent of any particular stratification of dom /; one only needs to know that 
/ is stratifiable in order to apply our result. The argument we present is entirely constructive and is 
motivated by the following fact. 

Proposition 3.4. Consider a closed, convex cone Q C R" , which is neither R" nor a half-space. Then 
the equality, 

Q = coco (bdQ). 

holds. 



Hence in light of Proposition 13.41 in order to obtain a representation formula for the Clarke subdif- 
ferential, it is sufficient to study the (relative) boundary structure of the Clarke normal cone. This is 
precisely the route we take. 

Lemma 3.5 (Frechet accessibility). Consider a closed set Q C R n , a C 2 -manifold M C Q, and a point 
x G M. Recall that the inclusion Nq(x) C Nm(x) holds. Suppose that a vector v € Nq(x) lies in the 
boundary of Nq{x), relative to the linear space Nm{x). Then there exists a sequence (xi,Vi) — > (x,v), 
with Vi £ Ng(xi), and so that all points X{ lie outside of M . 

Proof. Choose a vector w £ Nm{%) in such a way so as to guarantee 

v + tw<£N Q (x), for alii > 0. 

Consider the vectors 

y(t) := x + t(v + tw), (4) 
and observe y{t) £ x + Nq(x) for every t > 0. Consider a selection of the projection operator, 

x(t) e P Q (y(t)). 

Clearly, y(t) — > x and x(t) — > x, as t — > 0. Observe 

x{t) ± x, (5) 

for every i. 

We claim that the points x(t) all lie outside of M for all sufficiently small t > 0. Indeed, if this 
were not the case, then for sufficiently small i, the points xit) and y(t) would lie in the prox-normal 
neighborhood of M near x, and we would deduce 

x(t) = P M (y(t)) = x, 

contradicting ([5]). 

Thus all that is left is to show the convergence, y ^~ x ^ v. To this end, observe that from (Q}, we 
have 

1 ► v- (6) 



1 The idea to study the boundary structure of the Clarke normal cone in order to establish a convenient representation 
for the Clarke subdifferential is by no means new. For instance the same idea was used by Rockaffclar to establish the 
representation formula for the convex subdifferential |27l Theorem 25.6]. While working on this paper, the authors became 
aware that the same strategy was also used to prove a representation formula for the subdifferential of finite- valued, 
continuous functions whose epigraph has positive reach 1 1 21 Theorem 4.9], 1131 Theorem 2], In particular, Proposition 13.41 
also appears as 13, Proposition 3], 
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Hence it suffices to argue x ^\ x 0. By definition of x(t), we have 

\y(t)-x\>\(y(t)-x) + (x-x(t))\- (?) 
Squaring and simplifying the inequality above, we obtain 



y(t) — x x(t) — x \ 1 x(t) — x 



t ' t 



t 



2 



(8) 



From ([5]) and ([7]), we deduce that the vectors x are bounded as t — > 0. Consider any limit point 
7 £ R™. Taking the limit in ([8]), we obtain 

(«,7)>^l7| 2 - (9) 

Since v is a Frechet normal, we deduce 

(v,x(t) - x) < o(\x(t) - x\). 

It immediately follows that 

(v,l)<0, 

and in light of (HJ), we obtain 7 = 0. Hence 

y(t) - x{t) _ 
t ' 

as we claimed. □ 

Remark 3.6. We note, in passing, that an analogue of Lemma 1X51 (with an identical proof) holds when 
M is simply "prox-regular" , in the sense of [24], around x. In particular, the lemma is valid when M is 
a convex set. 



The combination of Lemma 13.51 and Proposition 13.41 yields dividends even in the simplest case when 
the manifold M of Lemma l3~5l is a singleton set. We should emphasize that in the following proposition, 
we do not even assume that the function in question is directionally Lipschitzian or stratifiable. 

Proposition 3.7 (Isolated singularity). Consider a continuous function f: U — > R, defined on an open 
set U C R". Suppose that f is differentiable onU\ {x} for some point x £ U , and that df(x) ^ 0. Then 

d c f(x) = cl ( conv { lim V ' f{Xi) '■ x% — > x} + coco { lim i,-V ' f(Xi) : Xi — > x} ) , 

ti-l-0 

under the convention that conv0 = {0}. 
Proof. Define the two sets 

E := { lim V/(Xj) : Xi -> x], H := { lim i i V/(x i ) : x t -> x}, 

i— too i— >oo 

fij.0 

and consider the epigraph Q := epi/ and the singleton set M := {(x, f(x))}. 
By Lemma 13.51 and continuity of /, we have 

bdN Q (x,f(x)) C coco(£: x {-1}) U (H x {0}). (10) 

Case 1. Suppose Nq(x, f{x)) is not equal to R" x [0, —00). Then from Proposition EH] and pH]) . we 
deduce 

N c Q (x, f(x)) = clcoco (E x {-1}) U (H x {0}). (11) 
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From (jTTJ) , we see that an inclusion (v, — 1) € Nq(x, f(x)) holds if and only if for every e > 0, there 
exist vectors t/ s € £U i? , and real numbers A$ > 0, for 1 < i < n + 1, satisfying 

u - ( X XiVi + X XiVi ) < e ' 

1= £ A, 

Thus d c f(x) = cl(convi? + coco-ff), as we claimed. 

Case 2. Now suppose N Q (x,f(x)) — R™ x [0,-oo). Then from (10]), we deduce i? = R n and 
dcf(x) = R™ = convi? + cocoi/, under the convention conv0 = {0}. □ 

As an illustration, consider the following simple example. 



Example 3.8. Consider the function f(x,y) :— \/x 4 + y 2 on R 2 . Clearly / is differentiable on R 2 \ 
{(0,0)}. The gradient has the form 

1 fx 3 



(x 4 + y 2 )3/4 V \y 

From Proposition I3.7[ we obtain 

d c f(x) = cl (conv{ lim V/(xi) : Xi — > x} + coco{ lim fjV/(xj) : Xi — > x} 

tiiO 

Observe that the vectors V/(x, 0) are equal to (±1,0), and the vectors 2y / |y|V/(0, ±y) are equal to 
(0, ±1), whenever x =/= =/= y. Thus we obtain 

[-1, 1] x {0} C conv{ lim V f{xt) : Xi -» x}. 

2— >oo 

{0} x R C coco{ lim £,V f(xi) : x\ — > x}. 

i— >oo 
4,4,0 

Consequently, the inclusion 

[-1,1] x rc a c /(o,o) 

holds. The absolute value of the first coordinate of \7f(x,y) is always bounded by 1, which implies the 
reverse inclusion above. Thus we have exact equality d c f(0, 0) = [—1, 1] x R. 

We record the following observation for ease of reference. 

Corollary 3.9. Consider a closed, convex cone Q C R" with nonempty interior. Suppose that bdQ is 
contained in a proper linear subspace. Then Q is either all of R™ or a half-space. 

Proof. Clearly if Q were neither R™ or a half-space, then by Proposition 13.41 we would deduce that 
Q = coco (bd Q) has empty interior, which is a contradiction. □ 

Armed with Proposition 13.41 and Lemma 13.51 we can now prove the main result of this section with 
ease. 

Theorem 3.10 (Characterization). Consider a proper, stratifiable function f: R" — > R that is finite at 
x. Suppose that f is vertically continuous and directionally Lipschitzian at all points o/dom/ near x. 
Then for any dense subset C dom/, we have 

N^ if (xJ(x)) = coco { lim (W^ - 1 ) -. Xi ^x} + (N^ omf (x) x {0}). (12) 

«-"» y/l+ \Vf( Xi )\ f 

Consequently, the Clarke subdifferential admits the presentation 

9 c f{x) = conv { lim V/fzj) : a* — >• x] + coco { lim WVf(xi) : x { ^> x] + NS f (x). 

2— »oo f i— >od f 
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Proof. We first prove (fT2| . Observe that since / is vertically continuous at x, we have N^ om j(x) x {0} C 
N£ pi j:(x,f(x)), and hence the inclusion "D" holds. Therefore we must establish the reverse inclusion. 
To this effect, intersecting the domain of / with a small open ball around x, we may assume that / is 
directionally Lipschitzian and vertically continuous at each point x £ dom/. For notational convenience, 

for a vector v £ R", let " :— ^ . Define the set- valued mapping 

V 1+ M 2 

f (s) := coco f { lim V/(x J )" :i,^i}U (iV dom , (a) n B) x {0}) , (13) 



By Proposition 13. 3[ the set {lim^oo V f{xj) : Xi — > x} is nonempty. Furthermore, from the established 

inclusion "D", we see that iVdom/(a;) is pointed and hence the set cocoiVdom/C^) i s closed for all x £ 
dom/. Consequently, we deduce 

F(x) = coco { lim Vf(xi) : x t ^> x} + (N% om Ax) x {0}), 

i->oo f 

Combining (|13[) with Lemma 12 .21 we see that F is outer-semicontinuous with respect to /-attentive 
convergence. 

Now consider a stratification of dom / into manifolds {Mi} having the property that / is smooth on 
each stratum Mi. Restricting the domain of /, we may assume that the stratification {Mi} consists of 
only finitely many sets. We prove the theorem by induction on the dimension of the strata M, in which 
the point x lies. 

Clearly, the result holds for all strata of dimension n, since / is smooth on such strata and f2 is dense 
in dom/. As an inductive hypothesis, suppose that the claim holds for all strata that are strictly greater 
in the partial order ^ than a certain stratum M and let x be an arbitrary point of M. 

Since / is smooth on M, we deduce that gph /| is a smooth manifold. Then by Lemma 13.51 for every 

vector ^ v £ vbN ep if(x,f(x)), there exists a sequence (xi,ri,vi) — > (x, f(x), v), with vi £ N cp i f(xi,ri) 
and (xi,ri) gph/| M . Suppose that there exists a subsequence satisfying r; ^ f(%l) for each index I. 
Then since / is vertically continuous at x, we obtain 

v = lim in C LimsupiV e pi/(x;,n) C N dom f(x) x {0} C F(x). 

On the other hand, if n = f(xi) for all large indices i, then restricting to a subsequence, we may 
assume that all the points xi lie in a a stratum M' with M' y M. The inductive hypothesis and 
/-attentive outer-semicontinuity of F yield the inclusion v £ F(x). 

Thus we have established the inclusion, 

rbJVepi/(x,/(s)) C F(x). 

Since d°°f(x) is pointed, we deduce that the cone iV e pi f(x) is neither a linear subspace nor a half-subspace. 
Consequently by Lemma HOI we deduce 

N cpif {xJ{x))=cocoTbN cpif {xJ(x)) C F(x). (14) 

In fact, we have shown that (fl4| holds for all points x £ M. 

Finally consider a limiting normal v £ N ep { f{x). Then there exists a sequence {xi, ri,vi) — > {x, f(x), v), 
with vi £ N e pi f( x h r i)- It follows from ()14p . the inductive hypothesis, and /-attentive outer-semicontinuity 
of F that the inclusion v £ F{x) holds. Thus the induction is complete, as is the proof of (fT2|) . 

To finish the proof of the theorem, define the two sets 

E := { lim V/(xj) : xi % x}, H := { lim WVf(xi) : x { A x}. (15) 

Observe 

coco { lim Vf{xi) : x, ^ x} = coco (E x {-1}) U (H X {0}) . 
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Thus an inclusion (v, — 1) € Nq(x, f(x)) holds if and only if there exist vectors yi 6 EU H and 
y G N£ om f(x), and real numbers A$ > 0, for 1 < i < n. + 1, satisfying 

The result follows. □ 

Recovering representation (IACLR[) of the introduction, in the setting of stratifiable functions, is now 
an easy task. 

Corollary 3.11. Consider a proper, stratifiable function f: R™ — > R, that is finite at x. Suppose that f 
is directionally Lipschitzian at all points o/dom / near x, and is continuous near x relative to the domain 
of f. Then we have 

dcf(x) = p| clconv (v/(n n B s (x))) + iV d c om/ (x), 

<5>0 

where Q is any dense subset o/dom/. 

Proof. Since the cone N£ pi j:(x, f(x)) is pointed, one can easily verify, much along the lines of Lemma [2T2l 
that the equation 

Pi ci C oco { J^IMl=£L :xe nn B s (x)\ -coco { lim ( V /(^),-l) 4 ^ 

holds. The result follows by an application of Theorem 13. 101 We leave the details to the reader. □ 

Our next goal is to recover the representation of the convex subdifferential (jCoRp of the introduction 
in the stratifiable setting. In fact, we will consider the more general case of amenable functions. Before 
proceeding, recall that a proper, lower-semicontinuous, convex function is directionally Lipschitzian at 
some point if and only if its domain has nonempty interior. A completely analogous situation occurs for 
amenable functions. 

Lemma 3.12. Consider a function f: R" — > R that is amenable at x. Let V be a neighborhood of x 
so that f can be written as a composition f = g o F, for a C 1 mapping F: V — > R m and a proper, 
lower-semicontinuous, convex function g : R m — > R, so that the qualification condition 

N domg (F{x)) n kerVF(x)* = {0}, 

is satisfied. Then there exists a neighborhood U of x so that 

1. F(U n int dom /) C int dom g, 

2. U C\ F" 1 (int dom g) C int dom/. 

Furthermore f is directionally Lipschitzian at x if and only if x lies in cl (int dom/). 

Proof. Let us first recall a few useful formulas. To this end, [531 Theorem 3.3] shows that there exists a 
neighborhood U of a; so that for all points ie(7n dom /, we have 

{0} = AW^i^WnkerVi^x)*, (16) 
df(x) = VF(x)*dg(F(x)), (17) 
N domf (x) = VF(x)*N domg (F(x)). (18) 

Furthermore a computation in the proof of Proposition 12.141 (item [5J shows that for any x € U PI dom / 

and any r > f(x), we have 

A^ cpi f (x, r) = VF(x)*N domg (F(x)) x {0}. (19) 
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Observe for any x € U f~l int dom /, we have 

= A dom/ (x) = VF(x)*N domg (F(x)), 

and consequently N domg (F(x)) = 0. We conclude F(x) € intdomg, thus establishing [T] 

Now consider a point x e U n F _1 (int dom g). Using (JT5J, we deduce N ep if(x,r) = for any 
r > f(x). Hence by [291 Exercise 6.19], we conclude (x, r) G intepi/ and consequently x € int dom/, 
thus establishing [2] 

By [29l Exercise 10.25 (a)], we have d°°f(x) = N dom f(x), and in light of (|16p and (|lgp one can readily 
verify that the cone N dom f(x) is pointed if and only if N domg (F(x)) is pointed. 

Now suppose that x lies in cl (int dom/). Then by [T] the domain dom g has nonempty interior and 
consequently N domg (F(x)) is pointed, as is iV<i om /(x). 

Conversely suppose that / is directionally Lipschitzian at x. Then N domg (F(x)) is pointed, and 
consequently dom g has nonempty interior. Observe since F is continuous, the set F" 1 (int doing) C dom / 
is open. Hence it is sufficient to argue that this set contains x in its closure. Suppose this is not the 
case. Then there exists a neighborhood U of x so that the image F(U) does not intersect int dom g. It 
follows that the range of the linearised mapping w F(x) + WF(x)w can be separated from dom g, thus 
contradicting (|16l) . See 29, Theorem 10.6] for a more detailed explanation of this latter assertion. □ 

We can now easily recover, in the stratifiable setting, representation (|CoR|) of the introduction. In 
fact, an entirely analogous formula holds more generally for amenable functions. 

Corollary 3.13. Consider a proper stratifiable function f : R" — > R, that is amenable at a point x, and 
so that x lies in the closure of the interior of dom /. Let Q be any dense subset of dom /. Then the 
subdifferential admits the presentation 

df(x) = conv { lim V/(x, i ) : x. L -4 x} + N domf (x). 

i— >oo 

Proof. By Exercise 10.25], we have d°°f(x) = N doTn f(x). Thus we have 

coco { lim UVflxi) : x t — > x} C N dom t (x). 

i^-oo f 
tij.0 

Observe / is amenable, directionally Lipschitzian (Lemma 13.121) , and vertically continuous (Proposi- 
tion at each point of dom / near x. Applying Theorem 13. 101 we deduce 

df(x) = conv { lim V/(xj) : x l % x] + iV dom/ (x). 

Noting that the subdifferential map df of an amenable function is outer-semicontinuous, the result 
follows. □ 

A natural question arises. Does the corollary above hold more generally without the stratifiability 
assumption? The answer turns out to be yes. This is immediate, in light of (jCoRp . for the subclass of 
lower-C 2 functions (those functions that are locally representable as a difference of convex functions and 
convex quadratics). A hrst attempt at a proof for general amenable functions might be to consider the 
representation / = g o F and the chain rule 

df(x)=WF(x)*dg(F(x)). 

One may then try to naively use Rockafellar's representation formula (|CoR| for the convex subdifferential 

dg{F(x)) = conv{ lim Vg{y t ) : y t -> F(x)} 

i— >oo 

to deduce the result. However, we immediately run into trouble since F may easily fail to be surjective 
onto a neighborhood of F(x) in dom g. Hence a different more sophisticated proof technique is required. 
For completeness, we present an argument below, which is a natural extension of the proof of |271 
Theorem 25.6]. It is furthermore instructive to emphasize how the stratifiability assumption allowed us 
in Corollary 13. 131 to bypass essentially all the technical details of the argument below. 
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Theorem 3.14. Consider a function /: R" — » R that is amenable at a point x lying in cl (int dom /). 
Then the subdifferential admits the presentation 

df(x) = conv{ lim V/(ar<) : x l % x] + N domf (x), 

where is any full-measure subset o/dom/. 

Proof. Recall that / is Clarke regular at x, and therefore d°° f(x) — N dom f(x) is the recession cone of 
df{x). Combining this with the fact that the map df is outer-semicontinuous at x, we immediately 
deduce the inclusion "d" . 

We now argue the reverse inclusion. To this end, let V be a neighborhood of x so that / can be 
written as a composition / = g o F, for a C 1 mapping F : V — » R m and a proper, lower-semicontinuous, 
convex function g : R™ 1 — > R, so that the qualification condition 

N domg (F(x)) n kerVF(x)* - {0}, 

is satisfied. Since / is dircctionally Lipschitzian at x, the subdifferential df(x) is the sum of the convex 
hull of its extreme points and the recession cone N dom / (x) ■ Furthermore every extreme point is a limit 
of exposed points. Thus 

df(x) = conv(cl£) +N d omf(x), 

where E is the set of all exposed point of df(x). 

Hence to prove the needed inclusion, it suffices to argue the inclusion 

E C conv { lim Vf(xi) : Xi — » x\. 

i— >oo 

Here, we should note that since / is directionally Lipschitzian at x, the set on the right hand side is 
closed. 

To this end, let v be an arbitrary exposed point of df(x). By definition, there exists a vector a £ R™ 
with \a\ = 1 and satisfying 

(a, v) > (a, v) for all v € df(x) with v ^ v. 
Since Nd om f(x) is the recession cone of df(x), from above we deduce 

(a, z) < for all ^ z e N dom f(x), 

and consequently 

(VF(x)o, w) < for all 7^ w G iVdom 9 (-F , (S)). 

Consider the half-line {F(x) + tWF(x)a : t > 0}. We claim that this half-line cannot be separated from 
domg. Indeed, otherwise there would exist a nonzero vector w £ N domg (x) so that for all t > and all 
x G dom<7 we have 

(x,w) < (F(x) +tVF(x)a,w) < (F(x),w), 

which is a contradiction. Hence by [27l Theorem 11.3], this half-line must meet the interior of dom g. By 
convexity then there exists a real number a > satisfying 

{F(x) + tVF(x)a : < t < a} C int (dom g). 

Consequently the points F(x + ta) lie in intdomg for all sufficiently small t > 0. By Lemma T3.12[ we 
deduce that there exists a real number (3 > so that 

{x + ta : < t < (3} C int (dom/). 

Hence / is Lipschitz continuous at each point x + ta (for < t < j3), and so from ( |LipR[ ) we obtain 

df(x + ta) — conv{ lim Vf(xj) : Xj ^4 x + ta}. (20) 
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Now choose a sequence ti — > and observe that by [37J Theorem 24.6], for any e > we have 

dg(F(x + t t a)) C argmax (VF(x)a, v) + eB, 

v€dg(F(x)) 

= argmax (a, WF(x)*v) + eB, 

v£dg(F(x)) 

for all large i. We deduce, 

\7F(x)*dg(F(x + f,-a)) C argmax(a, w) + eB = v + eB. 

Thus there exists a sequence Wi £ dg(F(x + Ua)) with VF(x)*Wi — > v. Consequently the vectors 
VF(i + tia)*Wi £ df(x + ti<i) converge to v. The result now easily follows from (|20|) and the fact that / 
is directionally Lipschitzian at x. □ 

The following is a further illustration of the applicability of our results to a wide variety of situations. 

Corollary 3.15. Consider a proper stratifiable function /: R™ — > R that is locally Lipschitz continuous 
at a point x, relative to dom/. Suppose furthermore that dom/ is an epi- Lipschitzian set at x. Then the 
formula 

df{x) = conv{ lim V/fo) : x, L -> x] + N% om f (x), 
holds, where VL is any dense subset of dom /. 

Proof. Since / is locally Lipschitz near x relative to dom/, there exists a globally Lipschitz function 
/: R" — > R, agreeing with / on dom / near x. Hence, we have 

f(x) — f(x) + 5d om f(x), locally near x. 

Combining this with [521 Exercise 10.10], we deduce 

d°° f{x) C N<i om f(x), for x near x. 

We conclude that / is directionally Lipschitzian at all points of dom / near x, and furthermore since 
the gradients of / are bounded near x so are the gradients of /. The result follows immediately by an 
application of Proposition 12 . 141 and Theorem 13. 101 □ 

4 Comments on random sampling 

Consider a proper, stratifiable function / : R" — > R, that is finite at x, directionally Lipschitzian at all 
points of dom / near x, and is continuous near x relative to the domain of /. Let us suppose for the time 
being that the normal cone Ndomf(x) is known exactly. Our interest lies in approximating the Clarke 
subdifferential d c f(x). In light of Corollary 13. 1 H one can quickly modify the method considered in [3], 
in order to achieve this goal. We now describe this procedure in more detail. 

Fix a certain radius S > and a sample space A = B$(x), along with an associated probability 
measure that is absolutely continuous with respect to the Lebesgue measure /j, on R™ . We assume that the 
corresponding density 9 is strictly positive almost everywhere on A. Observe that the set An dom V/ has 
nonempty interior, and consequently has positive probability. We can consider a sequence of independent 
trials with outcomes Xi £ A for z = 1,2,..., and form trial sets 

Ck = conv{V/(a; l ) : x t £ AfldomV/, 1 < i < k}. 

One may then hope that the sets D/, := Ck + Nd om Jx) approximate the Clarke subdifferential d c f(x) 
well, as k tends to infinity. One can verify this rigorously via some modifications of arguments made in 
[3]. The starting point is the following result. We omit the proof since it is identical to the argument in 
[3j Theorem 2.1]. 
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Theorem 4.1 (Limiting Approximation). Consider a function f: R" — > R and a point x £ dom/. 
Suppose that f is continuously differentiable on an open set dense in dom/. Then for any sampling 
radius S, we have 

oo 

cl Cfe = clconvV/ (Bg(x)), almost surely. 

k=l 

The following theorem establishes that cl UfeLi Dk is almost surely an outer approximation of d c f(x). 

Theorem 4.2 (Outer approximation). Consider a proper, stratifiable function f : R" —¥ R, that is finite 
at x. Suppose that f is directionally Lipschitzian at all points of dom / near x, and is continuous near x 
relative to the domain of f . Then 

oo 

d c f(x) C cl Dk, almost surely. 

k=l 

Proof. This is immediate from Theorem 14.11 and Corollary 13.111 □ 

On the other hand, the following lemma shows that cl (JfcLi Dk is not too much bigger than d c f(x), 
as long as the radius is sufficiently small and we restrict ourselves to considering only bounded subsets 
of R™. 

Lemma 4.3 (Truncated Inner approximation). Consider a proper, stratifiable function f : R" — > R, that 
is finite at x. Suppose that f is directionally Lipschitzian at all points of dom / near x, and is continuous 
near x relative to the domain of f . Then for any compact subset T C R™ and a real number e > we 
have, for any sufficiently small sampling radius, 

oo 

r n cl (J D k C r n d c f(x) + eB, almost surely. 

k=l 

If, in addition, f is Lipschitz continuous on its domain, then for any real number e > we have, for any 
sufficiently small sampling radius, 

oo 

cl (J Dk C d c f(x) + eB, almost surely. 

k=l 

Proof. Since / is directionally Lipschitzian at x, one can easily see check that the mapping d c f is outer- 
semicontinuous at x. Consequently the truncated mapping x i— > T n d c f(x) is upper-semicontinuous at 
x. We conclude that there exists a real number 5 > such that 

r n d c f(x + SB) c r n d c f(x) + eB. 

Observe that cl ljfc°=i Dk is almost surely contained in L n clconv<9 c /(x + 5B), and hence the result 
follows. 

Now suppose that / is Lipschitz continuous on its domain. Then, one can verify that that there exists 
a real number S > satisfying 

Vf(B s (x))cd c f(x)+eB. 
Since cl (Jfc=i Dk is almost surely contained clconv (Vf(B s (x)) + N domf (x)), the result follows. □ 

In particular, the distance of a fixed vector v to cl UfeLi Dk can be made arbitrarily close to the 
distance of v to d c f(x). 

Theorem 4.4 (Distance approximation). Consider a proper, stratifiable function f: R" — > R, that is 
finite at x. Suppose that f is directionally Lipschitzian at all points of dom / near x, and is continuous 
near x relative to the domain of f . Then for any vector v € R" and a real e > we have, for any 
sufficiently small sampling radius, 

oo 

dist (v, cl Dk) > dist (v, d c f(x)) — e, almost surely, 
fe=i 
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and consequently 

lim dist (v, clDk) — dist (v, d c f(x))\ < e, almost surely. 

k— >oo 

Proof. Let 7 := dist (i>, d c f(x)) and F = B 7 (v). Then by Lemma T4. 31 we have for any sufficiently small 
radius, 

00 

L n cl |J D k C r n d c f(x) + eB, almost surely. 

fc=i 

The result readily follows from the inclusion above and Theorem 14.21 □ 
A natural test for optimality, using the sampling scheme, is to determine whether the inclusion 

OeD k , 

holds. According to Theorem 14.21 if x is a Clarke critical point, then dist (0, D k ) — > almost surely. On 
the other hand, if x is not Clarke-critical, then by Theorem 14.41 for any sufficiently small radius, we have 
limfc^oo dist (0, clDk) > 0, and hence the test £ Dk will not generate a false positive. 

From a computational point of view, there is a certain difficulty we have not addressed. Suppose 
that for each radius <5, the trial points Xi are sampled with uniform distribution on the ball B$(x). One 
could worry that as the sampling radius decreases to zero, the proportion of points Xi discarded (those 
that lie outside of the domain) to those that are in the domain could be arbitrarily large, with positive 
probability. This, for instance, could happen if the domain was the set {{x,y) e R 2 : \y\ <x 2 ,0< x}. 
This pathology, however, does not occur in the directionally Lipschitzian setting. 

Proposition 4.5 (Domain of a continuous directionally Lipschitzian function). Consider a proper func- 
tion f: R™ — > R, that is finite at x. Suppose that f is directionally Lipschitzian at x and is continuous 
at x, relative to dom/. Then the domain of f is epi-Lipschitzian at x. 

Proof. By continuity of /, the domain of / is locally closed near x. Now observe that for each real number 
r satisfying r > f(x), we have iV epi j(x, r) — N dom f(x) x {0}. Consequently the inclusion 

N domf (x) x {0} C N cpi f {x, f(x)), 

holds. We conclude that the normal cone N doirL f(x) is pointed, and hence the domain of / is an epi- 
Lipschitzian set at x. □ 

The lower Lebesgue density of a set Q C R™ at a point x is 

Dens Q, x) := hmmf , 

.5^0 [i(Bs(x)) 

where we recall that fi denotes the Lebesgue measure on R™. 

Proposition 4.6 (Density of epi-Lipschitzian sets). Consider a set Q C R n that is epi-Lipschitzian at 
a point x. Then we have 

Dens~(Q,x) > 0. 

Proof. Since Q is epi-Lipschitzian at x, we may assume that Q is an epigraph of a Lipschitz continuous 
function /: R n_1 — > R, with x = (0,0) (See for example [1H1 Section 4]). We deduce f(x) < n\x\, for 
some constant n > and for all points in x £ R n_1 . Consequently, epi/ contains a convex cone with 
nonempty interior, a set that has strictly positive Lebesgue density. The result follows. □ 

The sampling scheme outlined in this section is effective whenever gradients are cheap to compute 
and the normal cone to the domain at the point of interest is known in advance. Now suppose that 
the normal cone is unknown to us, but nevertheless we can test whether a point is in the domain and 
we can project onto the domain easily. Then a slight modification of the sampling scheme can still be 
effective at approximating the Clarke subdifferential. Namely during our sampling scheme, rather than 
discarding points lying outside of the domain, we may use these points to approximate the normal cone 
by projecting onto the domain. See |15j for more details. Using this information along with the sampled 
gradients, we may still hope to approximate the whole Clarke subdifferential effectively. We, however, do 
not pursue this further in our work. 
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5 Local Dimension of semi-algebraic sub differential graphs. 



In this section, we apply our methods to study the size of subdifferential graphs of semi-algebraic functions. 
We first establish notation and record some preliminary facts from semi-algebraic geometry. 

5.1 Semi-algebraic Geometry. 

A semi- algebraic set S C R™ is a finite union of sets of the form 

{xeR" :Pi(x) = 0,...,P k {x)=0,Q 1 {x) <0,...,Qi(x) < 0}, 

where Pi, . . . , Pk and Qi, ■ ■ ■ ,Qi are polynomials in n variables. In other words, S is a union of finitely 
many sets, each defined by finitely many polynomial equalities and inequalities. A map F: R™ R m is 
said to be semi- algebraic if gphF C R" +m is a semi- algebraic set. Semi- algebraic sets enjoy many nice 
structural properties. Unless otherwise stated, we follow the notation of [5] and [3D]. 

A fundamental fact about semi-algebraic sets is provided by the Tarski-Seidenberg Theorem [HI The- 
orem 2.3]. Roughly speaking, it states that a linear projection of a semi-algebraic set remains semi- 
algebraic. From this result, it follows that a great many constructions preserve semi-algebraicity. In 
particular, for a semi-algebraic function /: R" —> R, the set-valued mappings dpf, df, df, and d c f are 
semi- algebraic. See for example [18[ Proposition 3.1]. 

Definition 5.1 (Compatibility). Given finite collections {Bi] and {Cj} of subsets of R™, we say that 
{Bi} is compatible with {Cj} if for all Bi and Cj, either Bi n Cj ; = or Bi C Cj. 

Definition 5.2 (Stratifications). Consider a set Q in R™. A stratification of Q is a finite partition of 
Q into disjoint, connected, manifolds Mi (called strata) with the property that for each index i, the 
intersection of the closure of Mi with Q is the union of some Mj's. 

Remarkably, semi-algebraic sets always admit stratifications. Indeed, the following stronger result 
holds. 

Proposition 5.3. \3(A Theorem 4-8] Consider a semi- algebraic function f : R™ — > R. Then there exists 
a stratification A o/dom/ so that f is smooth on each stratum Mi 6 A. Furthermore, if B is some other 
stratification o/dom/, then we can ensure that A is compatible with B. 

Definition 5.4 (Dimension). Let Q C R™ be a nonempty semi- algebraic set. Then the dimension of 
Q, denoted dimQ, is the maximal dimension of a stratum in any stratification of Q. We adopt the 
convention that dim0 = — oo. 

It can be easily shown that the dimension does not depend on the particular stratification. Observe 
that the dimension of a semi-algebraic set only depends on the maximal dimensional manifold in a 
stratification. Hence, dimension is a crude measure of the size of the semi- algebraic set. This motivates 
a localized notion of dimension. 

Definition 5.5 (Local dimension). Consider a semi-algebraic set Q C R" and a point 5 £ Q. We let 
the local dimension of Q at x be 

dimQ(x) := inf dim(Q fl B e (x)). 

It is not difficult to see that there exists a real number e > such that for every real number < e < e, 
we have diniQ^) = dim(Q fl B e (x)). 

There is a straightforward connection between local dimension and dimension of strata. This is the 
content of the following proposition. 

Proposition 5.6. \1(A Proposition 3.4] Consider a semi- algebraic set Q C R™ and a point x G Q. Let 
{Mi} be any stratification of Q. Then we have the identity 

dimg(a;) = maxjdimMi : x € c\Mi}. 
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Definition 5.7 (Maximal strata). Given a stratification {Mi} of a semi-algebraic set Q C R™, a stratum 
M is maximal if it is not contained in the closure of any other stratum. 

Using the defining property of a stratification, we can equivalently say that given a stratification {Mi} 
of a semi-algebraic set Q C R™, a stratum M,; is maximal if and only if it is disjoint from the closure of 
any other stratum. 

Proposition 5.8. \1IA Proposition 3.7] Consider a stratification {Mi} of a semi-algebraic set Q C R". 
Then given any point x G Q, there exists a maximal stratum M satisfying x G cl M and dim M = dimQ (x) . 

Semi-algebraic methods have recently found great uses in set-valued analysis. See for example HU 
[TBI EH HZ] • A fact that will be particularly useful for us is that semi- algebraic set- valued mappings are 
"generically " inner-semicontinuous . 

Proposition 5.9. Jill Proposition 2.28, 2.30] Consider a semi- algebraic, set-valued mapping G: R™ 
R m . Then there exists a stratification o/domG into finitely many semi-algebraic manifolds {Mi} such 
that on each stratum Mi, the mapping G is inner-semicontinuous and the dimension of the images F{x) 
is constant. If in addition F is closed-valued, then we can ensure that the restriction G\ M is also outer- 
semicontinuous for each index i. 

For a more refined result along the lines of Proposition l5.9[ see [HI Theorem 28]. The following result 
is standard. 

Proposition 5.10. |7J Theorem 3.18] Consider a semi- algebraic, set-valued mapping F: R" =4 R m . 
Suppose there exists an integer k such that F(x) is k- dimensional for each point x G domF. Then the 
equality, 

dim gph i* 1 = dimdomi 7, + k, 

holds. 

We will need a version of Proposition 15 . 1(71 that pertains to local dimension. 

Proposition 5.11. Consider a semialgebraic mapping F: R" =4 R m that is inner-semicontinuous on 
its domain. Suppose that there exist constants k and I such that for each pair (x,v) G gphi* 1 , we have 

dimdomF x = k, diniF( x ) v = I. 

Then gph_F has local dimension k + I around every pair (x,v) G gphF. 

Proof. Let it: R" x R m — > R™ be the canonical projection onto R™. Consider any stratification A of 
gph F and let M G A be any maximal stratum. Clearly 

dim M < dim gph F < k + I. 

Consider an arbitrary point x G tt(M). Since M is maximal, the set Mfl ({x} x R m ) is open relative 
to gphF n ({x} x R m ). Furthermore, since dimpM v = I for each vector v G F(x), it easily follows that 
dimMn ({x} x R m ) = I. 

We now claim that dim.7r(Af) = k. Indeed suppose this is not the case, that is the strict inequality 
dim7r(M) < k holds. Since dimdomF^ = k, we deduce that there exists a sequence Xi — ¥ x, with 
Xi G domi 7, and Xi ^ tt(M) for each index i. Since F is inner-semicontinuous on M, we deduce 

M n ({x} x R m ) C Limsup {x t } x F(x t ), 

i— >oc 

which contradicts maximality of M. Consequently, using Proposition 15.1(71 we deduce dimAf = k + I. 
Since M was an arbitrary maximal stratum, the result follows by an application of Proposition [^Hl D 
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5.2 Local dimension of semi-algebraic subdifferential graphs. 

We begin with a definition. 

Definition 5.12 (Subjets). For a function /: R™ — > R, the limiting subjet is given by 

[df] :={(x,f(x),v);vedf(x)}. 
Subjets corresponding to the other subdifferentials are dehned analogously. 

Much like /-attentive convergence, subjets are useful for keeping track of variational information in 
absence of continuity. In this section, we build on the following theorem. This result and its consequences 
for generic semi-algebraic optimization problems are discussed extensively in 

Theorem 5.13. Theorem 3.6] Let /: R" — > R be a proper semi- algebraic function. Then the subjets 
[dpf], [df], [df] and [d c f] have dimension exactly n. 

An immediate question arises: Can the four subjets have local dimension smaller than n at some of 
their points? In a recent paper [10] . the authors showed that this indeed may easily happen for [d c f]- 
Remarkably the authors showed that the subjets [dpf], [df], and [df] of a lower-semicontinuous, semi- 
algebraic function / : R™ — > R do have uniform local dimension n. The significance of this result and 
the relation to Minty's theorem were also discussed. In this section, we provide a much simplified proof 
of this rather striking fact (Theorem 15.16]) . The main tool we use is the following accessibility lemma, 
which is a special case of Lemma 13.51 Since the proof is much simpler than that of Lemmma 13.51 we 
include the full argument below. 

Lemma 5.14 (Accessibility). Consider a closed set Q C R™, a manifold M C Q, and a point x £ M . 
Recall that the inclusion Nq(x) C Nm{x) holds. Suppose that a proximal normal vector v £ Nq(x) lies 
in the boundary of Nq(x), relative to the linear space Nm(x). Then there exist sequences Xi — > x and 
Vi — > v, with Vi £ Nq(xi), and so that all the points Xi lie outside of M. 

Proof. There exists a real number A > so that x + Xv lies in the prox-normal neighborhood W of M at 
x and such that the equality Pq{x + Xv) — x holds. Consider any sequence Vi £ R" satisfying 

Vi -^v,Vi£ N M {x),v l £ Nq(x). 

Choose arbitrary points Xi £ Pq(x + Xvi). We have 

(x - x^ + Xvi £ Nq(xi). 

We deduce Xi ^ x. Clearly, the sequence Xi converges to x. We claim xt ^ M for all sufficiently 
large indices i. Indeed, if it were otherwise, then for large i, the points x + Xvi would lie in W and 
we would have x% £ Pm(x + Xvi) — x, which is a contradiction. Thus we have obtained a sequence 
(xi, j(x — Xi) + Vi) £ gphTVg, with Xi ^ M, and satisfying (xi, j(x — Xi) + Vi)) — > (x,v). □ 

The following is now immediate. 

Corollary 5.15. Consider a lower semicontinuous function f : R™ — > R, a manifold M C R™, and a 
point x £ M. Suppose that f is smooth on M and the strict inequality dmidp f(x) < dhaNu{x) holds. 
Then for every vector v £ dpf{x), there exist sequences {x%, f(xi),Vi) — > (x,f(x),v), with Vi £ dpf{xi), 
and so that all the points Xi lie outside of M . 

Proof. From the strict inequality, one can easily see that the normal con e ^ {x , f(x)) has empty interior 
relative to the normal space Af gp h (x, f{x)). An application of Lemma T5 . 141 completes the proof. □ 

We can now prove the main result of this section. 

Theorem 5.16. Consider a lower-semicontinuous, semi- algebraic function f : R" — > R. Then the subjets 
[dpf], [df], and [df] have constant local dimension n around each of their points. 
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Proof. We first prove the theorem for the subjet [dpf]. Consider the semi-algebraic set- valued mapping 



A I 



F(x) := {f(x)} x d P f(x), 

whose graph is precisely [dpf]. By Propositions l5.3l and l5.91 we may stratify the domain of F into finitely 
many semi-algebraic manifolds {Mi}, so that on each stratum M,, the mapping F is inner-semicontinuous, 
the images F(x) have constant dimension, and / is smooth. Consider a triple (x,f(x),v) <G [dpf]. We 
prove the theorem by induction on the dimension of the strata M in which the point x lies. Clearly the 
result holds for the strata of dimension n, if there are any. As an inductive hypothesis, assume that the 
theorem holds for all points (x, f(x),v) £ [dpf] with x lying in strata of dimension at least k, for some 
integer k > 1. 

Now consider a stratum M of dimension k — 1 and a point x € M. If dim_F(a;) = n — dimM, then 

recalling that F is inner-semicontinuous on M and applying Proposition l5 . 1 il we see that the set gph F 

has local dimension n around (x, f(x),v) for any v € dpf(x). The result follows in this case. 

Now suppose dim_F(a;) < n — dimM. Then, by Corollary 15.151 for such a vector v, there exists 
a sequence (xi, f(x{), v{) — ¥ (x, f(x),v) satisfying (a;*, f(xi),Vi) G [dpf] an d Xi $ M for each index 
i. Restricting to a subsequence, we may assume that all the points Xi lie in a stratum K satisfying 
dim K > k. By the inductive hypothesis, we deduce 

d im [d P f]( x , f(x),v) > \imsup dim[ dp f](xi, f(xi),Vi) = n. 

i— >oo 

This completes the proof of the inductive step and of the theorem for the subjet [dpf]. 

Now observe that [dpf] is dense in [df] and in [df]. It follows that [df] and [df] also have local 
dimension n around each of their points. □ 

Surprisingly Theorem 15. 161 may fail in the Clarke case, even for Lipschitz continuous functions. 

Example 5.17. Consider the function /: R 3 — ► R, defined by 

!min{x, y, z 2 } if (x, y, z) e R^ 
mm{-a;, -y, z 2 } if (x, y, z) e R ; I 
otherwise. 

Let x £ R™ be the origin and let T := conv {(1, 0, 0), (0, 1, 0), (0, 0, 0)}. One can check that the local 
dimension of gph<9 c / at (x, v) is two for any vector v £ (conv (T U — T)J \ (T U — T). For more details see 
[TO| Example 3.11]. 
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